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Abstract. The spectrum of the Q0055-269 (z = 3.66) has been observed at the 
resolution of 14 km s -1 in the wavelength interval 4750-6300 A. The statistical 
distribution of the Doppler parameter for the Ly a lines is peaked at b ~ 23 km 
s -1 . The column density distribution is described by a power-law with a break at 
log Nh i — 14.5. Significant clustering, with £ ~ 1 at Av = 100 km s -1 , is detected 
for lines with log Nh i > 13.8. Two voids of size ~ 2000 km s -1 are found in the 
spectrum with a probability of 2 x 10 -4 . 

1 The clustering properties of the Ly a clouds 

No clustering in the velocity space has been detected so far for Ly a lines 
on scales 300 < Av < 30000 km s _1 (Sargent et al. 1980, 1982; Webb & 
Barcons 1991). Their spatial distribution is different from that of the metal- 
line systems selected by means of the CIV doublet, that are known to cluster 
on scales at least up to 600 km s _1 (Sargent et al. 1988). Preliminary results 
at higher resolution seem to indicate weak clustering on smaller scales (Av = 
50 — 300 km s _1 , Webb 1987, Chernomordik, this conference). 

The larger the density of the observed lines, the better the chances of 
detecting a clustering signal. For this reason we have looked for a QSO of 
redshift 3.66, Q0055-269, which has been observed at ESO La Silla with the 
NTT telescope and the EMMI instrument in the echelle mode (Cristiani et 
al., 1995). The weighted mean of the flux-calibrated spectra has a resolution 
R ~ 22000. The signal-to-noise ratio per resolution element at the continuum 
level ranges from S/N = 12 to 40 in the interval 4750-6300 A. 

Column densities and Doppler widths of the Ly a lines and metal-line 
systems have been derived by a fitting procedure. The statistical distribution 
of the Doppler parameter for the Ly a lines is peaked at b ~ 23 km s _1 . The 
column density distribution is described by a power-law with a break or cutoff 
at log Nhi ~ 14.5. A featureless distribution is rejected with a probability of 
99.94%. 
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Fig. 1. Two-point correlation function for the Lyman— a lines in Q0055-269. The 
continuous histogram is for lines with log Nh i > 13.8; the dotted histogram is for 
lines with log Nh i > 13.3. The continuous curve is the model described in Sec. 2 
and computed from the eq. (2) with 7 = 1.77, a = 150 km s -1 , r c i = 110 kpc and 
r = 280 kpc at z = 3.3. 

The two-point correlation function in the velocity space is defined as 



N obs (v,Av) 
N exp (v, Av) 



(1) 



where N a i, s is the observed number of line pairs with velocity separations 
between v and v+Av and N exp is the number of pairs expected from a random 
distribution in redshift. In our line sample N exp is obtained averaging 1000 
numerical simulations generated according to the cosmological distribution 
oc (1 + z) 1 , derived from the maximum likelihood analysis of the real line 
sample in the given interval of column density and redshift. In this way it 
is possible to correct for incomplete wavelength coverage due to gaps in the 
spectrum or occultation of weak lines due to strong complexes. No velocity 
splittings Av < 20 km s _1 were included because of the intrinsic line blending 
due to the typical widths of the Ly a lines. The 1<7 deviation from a random 

— 1/2 

distribution is given by the N obs , a good approximation in the case of weak 
clustering £ < 1. The resulting correlation function of Ly a lines in Q0055-269 
with Nhi > 13.3 is shown in Figure 1. It is clear that a weak but significant 
signal is present with £ ~ 0.34 ± 0.06 up to 350 km s _1 . 

Exploring the variations of the correlation function as a function of the 
column density, an increase in the first velocity bin appears as the col- 
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umn density threshold is raised. The maximum significant signal is ob- 
tained for lines with log Nhi > 13.8, for which the correlation function is 
£ = 0.89 ± 0.18 for Av = 100 km s" 1 . When computing £ for lines in the 
interval log Nhi = 13.3— 13.6 (including in the simulated spectra gaps due 
to lines with log Nhi > 13.8), no significant clustering is observed. 

For comparison the correlation function of the Ly a samples obtained from 
the QSO PKS 2126-158 (Giallongo et al., 1993) and 0014+81 (Rauch et al. 
1992) has been computed. A significant clustering for Av = 100 km s _1 
appears when the samples are limited to strong lines (with a lower significance 
due to the smaller density of lines in the individual spectra): in PKS 2126- 
158 £ = 1.02 ± 0.26 is obtained for lines with log N H i > 13.8, in Q0014+81 
£ = 0.85 ± 0.30 for lines with log N H i > 14.1. 

The correlation found for the Ly a cloud positions is less pronounced than 
for metal-line absorption systems or galaxies, but consistent with a scenario of 
gravitationally induced correlations, as expected in models where gravitation 
is an important confining agent (Mo, Miralda-Escude, &: Rees 1993). The 
correlation found would imply, in the standard CDM framework, that the 
Ly a clouds responsible for the weakest absorption are abundant in underdense 
regions. The redshift autocorrelation function could be lower than the spatial 
autocorrelation. 



2 The dimensions of the Ly a clouds 

At small velocity differences the three-dimensional spatial autocorrelation 
function £ r and the redshift autocorrelation function are related by the con- 
volution (Heisler, Hogan, &: White, 1989): 



r°° f 00 Hdr ( r V 

£„ = J o Hdr £(r) P(v\ r) oc J — (-J 
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(2) 

where a Gaussian distribution of the peculiar motions with respect to the 
Hubble flow is assumed together with a power-law spatial correlation function 
of the galaxy-type. 

At small velocity splittings, the redshift correlation scale r a depends main- 
ly on the cloud sizes r c \ and on the velocity dispersion. Although these quan- 
tities are poorly known, it is instructive to derive constraints on the cloud 
sizes and velocity dispersions from the observed correlation. Assuming an in- 
dex 7 = 1.77 for the power-law spatial correlation function, the fit shown in 
Fig. 1 provides upper limits on the cloud sizes as a function of the velocity 
dispersion and of the appropriate correlation scale. We obtain r c \ ~ 100 — 190 
kpc for <7„ = 50 — 160 km s _1 , and r Q ~ 245 — 420 kpc, respectively. In the 
former case a good fit to the overall shape of the observed correlation func- 
tion requires a very low value 7 = 1.1. For a v > 200 km s _1 the redshift 
correlation function becomes too flat and very steep 7 values are needed (e.g. 
<7„ = 300 km s _1 requires 7 = 4). 
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3 Voids in the Ly a forest 

Voids in the Ly a forest provide a test for models of the large-scale structure, 
even if changes in the IGM due to fluctuations of the UV ionizing flux can 
be efficient in depleting neutral hydrogen along the line of sight. 

Searches for megaparsec-sized voids have produced a couple of claims: a 
void with comoving size ~ 30 Mpc in the spectrum of Q0420-388 (Crotts 
1987,1989), whose statistical significance has been questioned (Ostriker, Ba- 
jtlik, & Duncan 1988); another one in Q0302-003, with a size ~ 23 Mpc (Do- 
brzycki &: Bechtold 1991). The statistical significance of a given void depends 
exponentially on the line density (Ostriker et al. 1988) and uncertainties in 
the line statistics strongly influence the probability estimate. High resolution 
data, less affected by blending effects, are ideal in this respect. 

We have searched for gaps in a sample of Ly a lines of Q0055-269 limited, to 
ensure statistical completeness and accuracy in the deblending, to log Nhi > 
13.3 and b < 30 km s" 1 . Two regions devoid of such lines have been found, 
centered respectively at A ~ 5000,5206 A , with sizes Av ~ 2009,2046 
km s _1 (i.e. ~ 20 Mpc). To establish the random probability of observing 
such gaps, a set of 50000 spectra have been simulated with the observed 
number (N = 178) of redshifts randomly generated in the same interval of 
the data (excluding the region within 8 Mpc from the quasar) according to 
the cosmological distribution. The probability of getting a gap larger than the 
maximum observed one is then ~ 0.02. The 2046 km/s gap is significant only 
to 2.2(7 level, but the joint probability for the presence of the two observed 
gaps in the same spectrum is ~ 2 x 10 -4 . Even if underdense regions are 
statistically significant in our spectrum, the filling factor is less than 10%. 
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